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1 MULTES/IS: an effective and reliable test generation system for 
@) partial scan and non-scan synchronous circuits 
T. Ogihara , K. Muroi , G. Yonemori , S. Murai 
Proceedings of the 1989 26th ACM/IEEE conference on Design 
automation conference June 1989 

This paper describes an automatic test generation system which 
effectively generates test vectors by recognizing the circuit blocks for 
which vectors are automatically generated and the circuit blocks for 
which vectors have to be manually prepared. Test vectors for full 
scan, partial scan and nonscan synchronous circuit blocks are 
automatically generated. Test vectors for asynchronous circuit blocks 
have to be manually prepared. 
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2 HAL: A multi-paradigm approach to automatic data path synthesis 77% 
Si P. G. Paulin , J. P. Knight , E. F. Girczyc 

Papers on Twenty-five years of electronic design automation June 1988 
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K. Zhou , M. Chu , C. You , J.-R. Guo , J.-R. Guo , J. Mayega , B. S. 
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Proceedings of the international symposium on Field programmable gate 
arrays February 2003 
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The low operating speed of current CMOS Field Programmable Gate 
Arrays (FPGAs), i.e., 10-220 MHz, has prevented their use in 
high-speed digital applications. With the advent of IBM Silicon 
Germanium (SiGe) 7HP technology, designers have been able to 
design FPGAs operating in the gigahertz range. This paper is going to 
elaborate on the implementation of a 4-bit ripple-carry full adder (FA) 
on the new SiGe FPGA with new architectures and a novel power 
management strategy. The 1-bit FA can be reali ... 

4 Poster session: A high resolution diagnosis technique for open and 77% 
@] short defects in FPGA interconnects 

Mehdi Baradaran Tahoori 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

A two-step diagnosis flow, coarse-grain and fine-grain, is presented 
in order to identify a faulty element in the FPGA interconnects. The 
fault models used for interconnect are open, resistive-open, and 
bridging fault. The coarse-grain phase identifies the faulty net, the 
routing between two consecutive sequential elements in the FPGA. 
This phase is performed by just post-processing tester results for the 
test configurations used for interconnect testing. During the 
fine-grain step, the faulty n ... 

5 Poster session: Application-dependent testing of FPGAs for 77% 
13 bridging faults 

Mehdi Baradaran Tahoori 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

A new technique is presented for testing for bridging faults in the 

... interconnects-of an arbitrary design implemented in an FPGA. The 

configuration of the routing resources used in the original design 
remains .unchanged in the test configurations. Only the logic blocks 
used in the design are reprogrammed in order to implement 
single-term functions, logic functions with only one minterm or one 
maxterm. As shown by formal proofs, all activated faults are detected 
when single-term functions and appro ... 

6 Poster session: A physical retiming algorithm for field 77% 
13 programmable gate arrays 

Peter Suaris , Dongsheng Wang , Pei-Ning Guo , Nan-Chi Chou 
Proceedings of the international symposium on Field programmable gate 
arrays February 2003 

In this paper, we present a physical retiming algorithm for sequential 
circuits implemented in field programmable gate arrays (FPGAs). This 
algorithm can speed up the sequential circuits by reducing delay of all 
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critical paths with negative slacks. By taking advantage of the 
physical information provided by placed circuits, this algorithm 
integrates two operations: retiming and register duplication. Retiming 
moves registers across combinational components. Register 
duplication moves registers ac ... 

7 Poster session: Design strategies and modified descriptions to 77% 
31 optimize cipher FPGA implementations: fast and compact results 

for DES and triple-DES 

Gael Rouvroy , Francois-Xavier Standaert , Jean-Jacques Quisquater , 
Jean-Didier Legat 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

We propose a new mathematical DES description that allows 
optimized implementations. It also provides the best DES and 
triple-DES FPGA implementations known in term of ratio 
throughput/area, where area means the number of FPGA slices used. 
First, we get a less resource consuming unrolled DES implementation 
that works at data rates of 21.3 Gbps (333 MHz), using VIRTEX II 
technology. In this design, the plaintext, the key and the mode 
(encryption/decrytion) can be changed on a cycle-by-cycle basis ... 

8 Poster session: Wireless sensor networks: a power-scalable motion 77% 
3) estimation IP for hybrid video coding 

Federico Quaglio , Maurzio Martina , Fabrizio Vacca , Guido Masera , 

Andrea Molino , Gianluca Piccinini , Maurizio Zamboni 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Wireless Sensor Networks are an emerging phenomenon in the 

. research community. The design and development of network" 

architectures and nodes implementation are fostering many research 
activities. Due to their wide application fields and pervasive 
employment possibilities, the investigation of novel classes of 
wireless sensor nodes is of great concern. In this paper we presented 
a novel Power-Scalable Motion Estimation IP suitable for 
video-surveillance over Wireless Sensor Networks. The proposed ... 

9 Poster session: Lattice adaptive filter implementation for FPGA 77% 
3) Zdenek Pohl , Rudolf Matousek , Jin Kadlec , Milan Tichy , Miroslav Lfcko 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Our poster introduces an innovative RLS Lattice filter implementation 
for FPGAs. The signal processing applications typically require wide 
numeric range, and that poses a problem when using an FPGA 
implementation. Our approach is based on arithmetic using 



3 of 7 



5/7/03 10:17AM 



http://portal.acm.org/resultsxfm?coll^orta^dl=ACM&CFID=3472318&CFTOKEN=33008695 



logarithmic numeric representation (LNS). The test application - an 
adaptive noise canceller - has been optimized for the Xilinx Virtex 
devices. It consumes roughly 70% of all logic resources of the 
XCV800 device and all block memory cells. The ... 

10 Poster session: An FPGA architecture with built-in error correction 77% 
S) capability 

P. K. Lala , B. Kiran Kumar 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

The use of very deep submicron technology makes VLSI-based digital 
systems more susceptible to transient or soft errors, and thus 
compromises their reliability. This paper proposes an FPGA 
architecture inspired by the human immune system that allows 
tolerance of transient errors. The architecture is composed of a 
two-dimensional array of identical functional cells with different 
genetic codes. These codes are chosen based on the required 
functions to be performed by the functional cells. An erro ... 

11 Poster session: Synthetic circuit generation using clustering and 77% 
Si iteration 

Paul D. Kundarewich , Jonathan Rose 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

The development of next-generation CAD tools and FPGA 
architectures requires benchmark circuits to experiment with new 
algorithms and architectures. There has always been a shortage of 
good public benchmarks for these purposes, and even companies 
that have access to proprietary customer designs could benefit from 

. designs that meet size and other particular specifications; In this 
paper, we present a new method of generating realistic synthetic 
benchmark circuits to help alleviate this shortage. ... 

12 Poster session: Reconfigurable randomized K-way graph 77% 
@) partitioning 

Fatih Kocan 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

In this paper, a randomized k-way graph partitioning algorithm is 
mapped onto reconfigurable hardware. The randomized algorithm 
relies on repetitive running of the same algorithm with different 
random number sequences to achieve the (near-)optimal solution. 
The run-time and hardware requirements of this reconfigurable 
solution per a random number sequence are 0(| V|-K) cycles and 
0(|V|log|V| + |E|) gates and flip-flops, respectively. Performance is 
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improved further at the expense of more hardware b ... 

13 Poster session: An automated and power-aware framework for 77% 

13 utilization of IP cores in hardware generated from C descriptions 
targeting FPGAs 

Alex Jones , Prith Banerjee 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Use of hand optimized Intellectual Property (IP) logic cores is prolific 
in hardware design. While IP cores remain a standard way to utilize 
the improvement in FPGA technology and contend with time to 
market pressure through reuse, popularity of tools generating 
hardware descriptions from high-level languages is also increasing in 
popularity. PACT HDL combines these two methods within a 
power-aware framework. The PACT HDL compiler generates power 
optimized VHDL/Verilog from a C language descript ... 

14 Poster session: Power-aware architectures and circuits for 77% 
ED FPGA-based signal processing 

Frank Honore , Ben Calhoun , Anantha Chandrakasan 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

This work showcases a power-aware system design methodology for 
DSP applications on reconfigurable hardware platforms. In particular, 
an enhanced FPGA architecture is proposed and analyzed for a deep 
submicron process technology. These enhancements reduce 
Configurable Logic Block (CLB) usage for distributed arithmetic 
implementations of signal processing applications by 50% or more 
thereby reducing the load on interconnect resources. Multi-Threshold 
CMOS (MTGMOS) circuit design techniques are ag ... 

15 Poster session: FPGAs in critical hardware/software systems 77% 
3) Adrian J. Hilton J. Adrian J. Hilton , Gemma Townson , Jon G. Hall 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

FPGAs are being used in increasingly complex roles in critical 
systems, interacting with conventional critical software. Established 
safety standards require rigorous justification of safety and 
correctness of the conventional software in such systems. Newer 
standards now make similar requirements for safety- related 
electronic hardware, such as FPGAs, in these systems. In this paper 
we examine the current state-of-the-art in programming FPGAs, and 
their use in conventional (low-criticality) hard ... 
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16 Poster session: A SC-based novel configurable analog cell 77% 
31 Binlin Guo , Jiarong Tong 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

This paper presents a high performance Configurable Analog Cell 
(CAC) which is made up of a Basic Configurable Analog Cell (BCAC) 
and a digital converter block. The CAC can be used either for Field 
Programmable Analog Array (FPAA) or for Field Programmable 
Digital-Analog Mixed Array (FPMA). The BCAC include three 
innovative Programmable Switch Blocks (PSBs), three Programmable 
Capacitor Arrays (PCAs), and an amplifier. PSB and PCA can be 
programmed to generate many equivalent components. In addi ... 

17 Poster session: On computation and resource management in an 77% 
0 FPGA-based computation environment 

Soheil Ghiasi , Karlene Nguyen , Elaheh Bozorgzadeh , Majid 
Sarrafzadeh 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

The idea of managing the comprising computations of an application 
executed in an FPGA-based system is presented. An efficient 
algorithm for exploiting the timing slack of building blocks of the 
application is proposed. The slack of these blocks can be utilized by 
replacing them with slower but "cheaper" modules and by assigning 
the computations to the proper resources. Thus, our approach 
manages the comprising computations and system resources at the 
same time. This is performed without comprom ... 

18 Poster session: Testing for bit error rate in FPGA communication 77% 
@) interfaces 

Yongquan Fan , Zeljko Zilic 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

FPGAs have witnessed an increased use of dedicated communication 
interfaces. With their increased use, it is becoming critical to test and 
properly characterize all such interfaces. Bit error rate (BER) 
characteristic is one of the basic measures of the performance of any 
digital communication system. We propose a scheme for BER testing 
in FPGAs, which exhibits a few orders of magnitude speedup 
compared to traditional software simulation methods. In this scheme, 
we include a novel implementation ... 

19 Poster session: On hiding latency in reconfigurable systems: the 77% 
Bl case of merge-sort for an FPGA-based system 

Hossam EIGindy , George Ferizis 
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Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Recursive solutions are effective software techniques that are difficult 
to map into hardware due to their dependency on input size and data 
values. As a result, most high-level design tools do not allow for 
recursive calls. In this paper we present a technique for mapping the 
merge-sort algorithm, as a case study, into a reconfigurable system. 
Our mapping employs an on-line prediction method to reconfigure 
the necessary hardware only when the need arises, and to hide the 
reconfiguration delay. ... 



20 Poster session: Using FPGAs for data and reorganization engines: 77% 
0) preliminary results for spatial pointer-based data structures 
Pedro C. Diniz , Joonseok Park 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

FPGAs have appealing features such as customizable internal and 
external bandwidth and the ability to exploit vast amounts of 
fine-grain instruction-level parallelism. In this paper we explore the 
applicability of these features in using FPGAs as data search and 
reorganization engines for performing search and reorganization 
computations over spatial pointer-based data structures for which 
traditional computing platforms perform poorly. The preliminary 
experiments, for a set of simple spatial qu ... 
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21 Poster session: Recursive circuit clustering for minimum delay and 
13 area 

Mehrdad Eslami Dehkordi , Stephen D. Brown 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

We present an effective recursive algorithm for circuit clustering for 
delay and area minimization, which is applicable to FPGAs. At the 
highest level of clustering, the circuit is clustered using a modified 
single-revel clustering algorithm. A cluster to netlist transformation 
technique is proposed, which converts each cluster into a new 
subcircuit. The algorithm then continues recursively by clustering the 
generated subcircuits into further levels of clusters. To reduce the 
amount of node dupl ... 



77% 



22 Poster session: Track placement: orchestrating routing structures 
31 to maximize mutability 

Katherine Compton , Scott Hauck 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

The design of a routing channel for an FPGA is a complex process, 
requiring the careful balance of flexibility with silicon efficiency. With 
the growing move towards embedding FPGAs into SoC designs, and 
the opportunity to automatically generate FPGA architectures, this 



77% 



1 of 7 



5/7/03 10:17AM 



Results 



http://portal.aciuorg/resultsxfm?quei7=^&dl=ACM&CFID=3472318&CFTOKEN=33008695 



problem becomes even more critical. The design of a routing channel 
requires determining the number of routing tracks, the length of the 
wires in those tracks, and the positioning of the breaks on the tracks. 
This paper focuses o ... 

23 Poster session: Implementation of digital fixed-point 77% 
13 approximations to continuous-time IIR filters 

J. E. Carletta , R. J. Veillette , F. W. Krach , Z. Fang 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

An analytical framework for the implementation of digital infinite 
impulse response filters in fixed-point hardware on FPGAs is 
presented. It presumes that a continuous-time filter with the desired 
response is given. Within the framework, the constant coefficient bit 
widths are determined by accounting for the sensitivity of the filter's 
pole and zero locations with respect to the coefficient perturbations. 
The internal signal bit widths are determined by calculating 
theoretical bounds on the ra ... 

24 Poster session: A high-speed successive erasure BCH decoder 77% 
Si architecture 

Thomas Buerner 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

A new high speed architecture for a BCH successive erasure decoder 
is presented. The Berlekamp-Massey based decoder by Sarwate and 
Shanbhag is extended to handle successive erasures. The critical 
path in the calculation submodules is increased from Tadd+Tmult to 
Tadd+Tmult+Tmux. The proposed architecture is implemented 

• exemplary for a BCH(63,45,7) code with up to two erasures on a 
XILINX Spartan2E300-7. Thus a clock frequency of 95 MHz is 
reached using 47% of the available slices instead of 105 ... 

25 Poster session: Customized regular channel design in FPGAs 77% 
@) Elaheh Bozorgzadeh , Majid Sarrafzadeh 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

In this paper, we study the problem of customized regular 
segmentation design in FPGA routing channels. We propose a 
deterministic algorithm for segmentation design problem in which 
each interval is assigned to only one segment (1-Segmentation). We 
solve the problem of maximum number of incremental track 
assignment of intervals by mincost network flow technique for 
1-Segmentation design. The general K-Segmentation design problem 
can also be solved by some modifications in our algorithm. We have 
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26 Poster session: Design of a fingerprint system using a 77% 
31 hardware/software environment 

Lee Vanderlei Bonato , Rolf Fredi Molz , Joao Carlos Furtado , Marcos 
Flores Ferrao , Fernando G. Moraes 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Processing system of fingerprint are CPU time intensive, being 
normally implemented in software. This paper present a new 
algorithm for fingerprint features localization, that can be easily 
implemented in hardware (system-on-a-chip, FPGA). This algorithm 
is composed by 3 stages, first stage read a fingerprint image 
(255x255pixels, ash tones) and apply a Gaussian Filter, after this, 
apply a absolute difference mask (ADM) for detector the edges in the 
image filtered and the last stage look for fin ... 

27 Poster session: A granularity-based classification model for 77% 
S) systems-on-a-chip 

Stephan Bingemer , Peter Zipf , Manfred Glesner 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Field-programmable logic has become an increasingly important 
technology for the design of digital circuits. One interesting point in 
the field of reconfigurable logic is its classification within the 
implementation space of other technologies. Such a classification 
gains importance if FPGA technology becomes an integral part of 
Systems-on-a-Chip (SoC). The poster discusses an approach to 
classify technologies based on their granularity. Therefore, a new 
distinction into homogeneous and heteroge ... 

28 Poster session: An estimation and exploration methodology from 77% 
3) system-level specifications: application to FPGAs 

Sebastien Bilavarn , Guy Gogniat , Jean Luc Philippe 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Rapid evaluation and design space exploration from early 
specifications are important issues in the design cycle. We propose 
an original area vs. delay estimation methodology that targets 
reconfigurable architectures. Two main steps compose the estimation 
flow: i) structural estimations where architectural solutions are 
defined at the RT level, this step is technological independent and 
performs an automatic design space exploration and ii) physical 
estimations which perform technology mapping t ... 



3 of 7 



5/7/03 10:17AM 



Results 



http://portal.acm.or^resultsx<m?querv=.U3l&dl=ACM&CFID=3472318&CFTOKEN=33008695 



29 Poster session: A single-FPGA implementation of image connected 77% 
3 component labelling 

K. Benkrid , S. Sukhsawas , D. Crookes , S. Belkacemi 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

This paper describes an architecture based on a serial iterative 
algorithm for Image Connected Component Labelling with a hardware 
complexity O(N) for an NxN image. The algorithm iteratively scans 
the input image, performing a recursive non-zero maximum 
neighbourhood operation. A complete forward pass is followed by an 
inverse pass in which the image is scanned in reverse order. The 
process is repeated until no change in the image occurs. The 
algorithm has been coded in Handel C language and tar ... 

30 Poster session: A logic based approach to hardware abstraction 77% 
13 K. Benkrid , S. Belkacemi , D. Crookes 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

This paper presents a novel approach to hardware abstraction based 
on the logic programming language Prolog. This is an attempt to 
satisfy the dual requirement of abstract hardware design and 
hardware efficiency. Central to this approach is a hardware 
description environment called HIDE, which provides more abstract 
hardware descriptions and compositions than are possible in 
traditional hardware description languages such as VHDL or Verilog. 
HIDE enables highly scaleable and parameterised compos ... 

31 Poster session: Design framework for the implementation of the 77% 
13 2-D orthogonal discrete wavelet transform on FPGA 

A. Benkrid , D._Crookes-,-K.-Benkrid 

Proceedings of the international symposium on Field programmable gate 
arrays February 2003 

This paper gives a design framework for the implementation of the 
2-D Orthogonal Discrete Wavelet Transform (DWT) on FPGA. The 
architecture is based on the Pyramid Algorithm Analysis. Our 
architecture spatially maps the multistage filter banks of the DWT 
onto the Xilinx Virtex-E FPGA family. In this paper we propose a 
novel FIR structure to handle the computation along the borders 
using symmetric extension. The paper includes a new detailed 
mathematical approach to determine the architecture's d ... 

32 Poster session: Making area-performance tradeoffs at the high 77% 
S level using the AccelFPGA compiler for FPGAs 

P. Banerjee , V. Saxena , J. Uribe , M. Haldar , A. Nayak , V. Kim , D. 
Bagchi , S. Pal , N. Tripathi , R. Anderson 
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Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

Applications such as digital cell phones, 3G wireless receivers, and 
voice over IP, require DSP functions that are typically mapped onto 
general purpose DSP processors. With the introduction of advanced 
FPGA architectures which provide built-in DSP support such as the 
Xilinx Virtex-II, and the Altera Stratix, a new hardware alternative is 
available for DSP designers. DSP design has traditionally been 
divided into algorithm development and hardware/software 
implementation. The majority of DSP alg ... 

33 Poster session: FPGA implementation of a fast Hadamard 77% 
01 transformer for WCDMA 

Sanat Kamal Bahl , Jim Plusquellic 

Proceedings of the international symposium on Field programmable gate 

arrays February 2003 

In code division multiple access (CDMA) systems the base station 
identifies each user in a cell by unique orthogonal (Walsh) codes. The 
Walsh codes are generated at the transmitter using a 
Walsh-Hadamard function. A Fast Hadamard Transformer (FHT) is 
used at the receiver to decode the transmitted codes. The purpose of 
this study is to design a FHT which utilizes less hardware resources 
as compared to the existing designs and also suggest means for 
reducing the input length of the Walsh sequence. ... 

34 Poster session: FPGA-based design of an evolutionary controller for 77% 
13 collision-free robot navigation 

M. A. H. B. Azhar , K. R. Dimond 

Proceedings of the international symposium on Field programmable gate 

a rrays_ Fe b ru ary 2003 

The employment of field programmable gate arrays (FPGAs) to a 
robot controller is very attractive, since it allows for fast IC 
prototyping and low cost modifications. The speedup is achieved 
because of pipelining and dedicated functions in hardware that are 
customized to the problem. The self learning ability and the adaptive 
nature of an Artificial Neural Network (ANN) makes it a good 
candidate for the control structure of a robot's navigation. An 
evolutionary approach in designing robots can e ... 

35 Special session on on-chip multi-processing: Design experience of 77% 
Bl a chip multiprocessor merlot and expectation to functional 

verification 
Satoshi Matsushita 

Proceedings of the 15th international symposium on System Synthesis 
October 2002 
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We have fabricated a Chip Multiprocessor prototype code-named 
Merlot to proof our novel speculative multithreading architecture. On 
Merlot, multiple threads provide wider issue window beyond ordinal 
instruction level parallel (ILP) processors like superscalar or VLIW. 
With the architecture, we estimate 3.0 times speedup against single 
processing elements (PE) on speech recognition code and IDCT code 
with four PEs. Merlot integrates on-chip devices, PCI interface, and 
SDRAM interfaces. We have en ... 



36 Delivering acceleration: the potential for increased HPC application 77% 
13 performance using reconfigurable logic 

David Caliga , David Peter Barker 

Proceedings of the 2001 ACM/IEEE conference on Supercomputing 

(CDROM) November 2001 

SRC Computers, Inc. has integrated adaptive computing into its 
SRC-6 high-end server, incorporating reconfigurable processors as 
peers to the microprocessors. Performance improvements resulting 
from reconfigurable computing can provide orders of magnitude 
speedups for a wide variety of algorithms. Reconfigurable logic in 
Field Programmable Gate Arrays (FPGAs) has shown great advantage 
to date in special purpose applications and specialty hardware. SRC 
Computers is working to bring this technolog ... 

37 Embedded software automation: from specification to binary: 77% 
@) Retargetable binary utilities 

Maghsoud Abbaspour , Jianwen Zhu 

Proceedings of the 39th conference on Design automation June 2002 
Since software is playing an increasingly important role in 
system-on-chip, retargetable compilation has been an active 
. research area in the last few years. However, the retargetting of 
equally important downstream system tools, such as assemblers, 
linkers and debuggers, has either been ignored, or falls short of 
meeting the requirements of modern programming languages and 
operating systems. In this paper, we present techniques that can 
automatically retarget the GNU binutils tool kit, which con ... 

38 Constructing and exploiting linear schedules with prescribed 77% 
3l parallelism 

Alain Darte , Robert Schreiber , B. Ramakrishna Rau , Frederic Vivien 
ACM Transactions on Design Automation of Electronic Systems 
(TODAES) January 2002 
Volume 7 Issue 1 

We present two new results of importance in code generation for and 
synthesis of synchronously scheduled parallel processor arrays and 
multicluster VLIWs. The first is a new practical method for 
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constructing a linear schedule for the iterations of a loop nest that 
schedules precisely one iteration per cycle on each of a prescribed 
set of processors. While this problem goes back to the era in which 
systolic computation was in vogue, it has defied practical solution 
until now. We provide a closed ... 



39 IP Design and Reuse: High-level automatic pipelining for 77% 
13 sequential circuits 

Maria-Cristina V. Marinescu , Martin Rinard 

Proceedings of the international symposium on Systems synthesis - 

Volume 14 September 2001 

This paper presents a new approach for automatically pipelining 
sequential circuits. The approach repeatedly extracts a computation 
from the critical path, moves it into a new stage, then uses 
speculation to generate a stream of values that keep the pipeline full. 
The newly generated circuit retains enough state to recover from 
incorrect speculations by flushing the incorrect values from the 
pipeline, restoring the correct state, then restarting the 
computation. We also implement two extensions t ... 



40 Architecture Analysis and Automation: Automatic layout of 77% 
3) domain-specific reconfigurable subsystems for system-on-a-chip 
Shawn Phillips , Scott Hauck 

Tenth ACM International Symposium on Field-Programmable Gate 

Arrays February 2002 

When designing SOCs, a unique opportunity exists to generate 
custom FPGA architectures that are specific to the application domain 
in which the device will be used. The inclusion of such a device will 
provide an efficient compromise between the flexibility of software 

- -and the performance-of hardware, while at the same time allowing 
for post-fabrication modification of circuits. To automate the layout of 
reconfigurable subsystems for system-on -a-chip we present 
template reduction, standard cell, ... 
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41 Data and memory optimization techniques for embedded systems 77% 
0 P. R. Panda , F. Catthoor , N. D. Dutt , K. Danckaert , E. Brockmeyer , 

C. Kulkarni , A. Vandercappelle , P. G. Kjeldsberg 

ACM Transactions on Design Automation of Electronic Systems 

(TODAES) April 2001 

Volume 6 Issue 2 

We present a survey of the state-of-the-art techniques used in 
performing data and memory-related optimizations in embedded 
systems. The optimizations are targeted directly or indirectly at the 
memory subsystem, and impact one or more out of three important 
cost metrics: area, performance, and power dissipation of the 
resulting implementation. We first examine architecture-independent 
optimizations in the form of code transoformations. We next cover a 
broad spectrum of optimizati ... 

42 Coarse grain reconfigurable architecture (embedded tutorial) 77% 
ED Reiner Hartenstein 

Proceedings of the conference on Asia South Pacific Design Automation 

Conference January 2001 

The paper gives a brief survey over a decade of R&D on coarse grain 
reconfigurable hardware and related compilation techniques and 
points out its significance to the emerging discipline of reconfigurable 
computing. 
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43 A decade of reconfigurable computing: a visionary retrospective 77% 
3) R. Hartenstein 

Proceedings of the DATE 2001 on Design, automation and test in Europe 
March 2001 



44 How to solve the current memory access and data transfer 77% 
3 bottlenecks: at the processor architecture or at the compiler level 

Francky Catthoor , Nikil D. Dutt , Christoforos E. Kozyrakis 
Proceedings of the conference on Design, automation and test in Europe 
January 2000 

45 Resolution of dynamic memory allocation and pointers for the 77% 

12 behavioral synthesis form C 

Luc Semeria , Koichi Sato , Giovanni De Micheli 

Proceedings of the conference on Design, automation and test in Europe 
January 2000 

46 Designing systems-on-chip using cores 77% 

13 Reinaldo A. Bergamaschi , William R. Lee 

Proceedings of the 37th conference on Design automation June 2000 
Leading-edge systems-on-chip (SoC) being designed today could 
reach 20 Million gates and 0.5 to 1 GHz operating frequency. In order 
to implement such systems, designers are increasingly relying on 
reuse of Intellectual property (IP) blocks. Since IP blocks are 
pre-designed and pre-verified, the designer can concentrate on the 
complete system without having to worry about the correctness or 

performance of the individual components. That is the goal, in . 

theory.-In praetice,-assembling-on~SoCT.~ 

47 HAL: a multi-paradigm approach to automatic data path synthesis 77% 
a P. G. Paulin , J. P. Knight , E. F. Girczyc 

Proceedings of the 23rd ACM/IEEE conference on Design automation 
July 1986 

A novel approach to automatic data path synthesis is presented. This 
approach features innovations in the synthesis process as well as in 
the system implementation. The synthesis process exhibits three new 
features. The first relates to a subtask that performs an expert 
analysis of the input data flow graph and attempts to evenly 
distribute operations requiring similar resources. This is done using a 
novel “load balancing” technique. The second consists 
of a global pr ... 
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48 Design of system interface modules 77% 
3) Jane S. Sun , Robert W. Brodersen 

Proceedings of the 1992 IEEE/ ACM international conference on 
Computer-aided design November 1992 

49 ISDL: an instruction set description language for retargetability 77% 
@) George Hadjiyiannis , Silvina Hanono , Srinivas Devadas 

Proceedings of the 34th annual conference on Design automation 
conference June 1997 

50 An extendable MIPS-I processor kernel in VHDL for 77% 
3l hardware/software co-design 

M. Gschwind , D. Maurer 

Proceedings of the conference with EURO-VHDL'96 and exhibition on 
European Design Automation September 1996 

51 Gate-level test generation for sequential circuits 77% 
01 Kwang-Ting Cheng 

ACM Transactions on Design Automation of Electronic Systems 
(TODAES) October 1996 
Volume 1 Issue 4 

This paper discusses the gate-level automatic test pattern generation 
(ATPG) methods and techniques for sequential circuits. The basic 
concepts, examples, advantages, and limitations of representative 
methods are reviewed in detail. The relationship between gate-level 
sequential circuit ATPG and the partial scan design is also discussed. 

52 Constructing application-specific heterogeneous embedded .77% 

S) architectures" from" custom"HW/SW applications 

Steven Vercauteren , Bill Lin , Hugo De Man 

Proceedings of the 33rd annual conference on Design automation 

conference June 1996 

53 Functional verification methodology of Chameleon processor 77% 
2) Frangoise Casaubieilh , Anthony Mclsaac , Mike Benjamin , Mike Bartley 

, Frangois Pogodalla , Frederic Rocheteau , Mohamed Belhadj , Jeremy 
Eggleton , Gerard Mas , Geoff Barrett , Christian Berthet 
Proceedings of the 33rd annual conference on Design automation 
conference June 1996 

54 Partitioning of VLSI circuits and systems 77% 
S) Frank M. Johannes 

Proceedings of the 33rd annual conference on Design automation 
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55 Hardware-software-codesign of application specific 77% 

13 microcontrollers with the ASM environment 

A. Both , B. Biermann , R. Lerch , Y. Manoli , K. Sievert 
Proceedings of the conference on European design automation 
conference September 1994 
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